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ABSTRACT 

The determination of the density parameter r^o from the large-scale distribution of 
galaxies is one of the major goals of modern cosmology. However, if galaxies are bi- 
ased tracers of the underlying mass distribution, linear perturbation theory leads to 
a degeneracy between D,q and the linear bias parameter b, and the density parameter 
cannot be estimated. In Matarrese, Verde & Heavens (1997) we developed a method 
based on second-order perturbation theory to use the bispectrum to lift this degener- 
acy by measuring the bias parameter in an fio-independent way. The formalism was 
developed assuming that one has perfect information on the positions of galaxies in 
three dimensions. In galaxy redshift surveys, the three-dimensional information is im- 
perfect, because of the contaminating effects of peculiar velocities, and the resulting 
clustering pattern in redshift space is distorted. In this paper, we combine second- 
order perturbation theory with a model for collapsed, virialised structures, to extend 
the method to redshift space, and demonstrate that the method should be success- 
ful in determining with reasonable accuracy the bias parameter from statc-of-thc-art 
surveys such as the Anglo-Australian 2 degree field survey and the Sloan digital sky 
survey. 

Key words: cosmology: theory - galaxies: clustering and redshift - galaxies: bias - 
large-scale structure of Universe 
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1 INTRODUCTION 

One of the most important challenges of observational cos- 
mology today is to measure the density parameter flo, that 
plays a key role in the determination of the properties and 
the geometry of the Universe. Geometrical tests such as 
number counts or the apparent magnitude as a function 
of the redshift of standard candles suffer from serious un- 
certainties, being sensitive to possible luminosity evolution 
with redshift and to the effect of a possible cosmological 
constant. This motivates attempting to measure Qq in the 
local Universe from large-scale structure (LSS). Indeed, the 
only strong suggestions that the Universe may be close to 
the critical density comes from peculiar velocities or redshift 
distortions studies of LSS. These studies have the advan- 
tage that evolution and geometrical effects are negligible, 
but have some drawbacks. The amplitude of the distortions 
on linear scales allows measurement of the density param- 
eter only in the degenerate combination f3 = Qq'^ /b where 
b is the linear bias parameter (e.g. Kaiser 1987; Hamilton 
1992; Fisher, Scharf & Lahav 1994; Heavens & Taylor 1995; 



Ballinger, Heavens & Taylor 1995; Cole, Fisher & Weinberg 
1995). The current values for /3 fall roughly in the range 0.5-1 
depending on the measurement method used and on the cat- 
alogue selection criteria (Strauss & Willick 1995; Hamilton 
1997). Even if /3 = 0.5, flo could still be close to its critical 
value if dark matter is poorly traced by luminous objects. 
The determination of the density parameter from linear pe- 
culiar velocities or redshift space distortion studies is there- 
fore compromised by the lack of a good measurement of the 
bias parameter. This motivates methods which attempt to 
measure Qo directly (Dekel & Rees 1994; Fan, Bahcall & 
Cen 1997) or methods to determine 6, which then allows 
determination of Qq from l3. In this paper, we pursue the 
latter approach: there is a relative bias between IRAS and 
optically selected galaxies (e.g. Peacock 1997), so at least 
one of the galaxy types must be at best a biased tracer of 
the underlying mass distribution. In practice, we have very 
little idea of how bias works and evolves, and there is there- 
fore strong motivation to measure it empirically from the 
galaxy distribution. Future Cosmic Microwave Background 
experiments may allow the simultaneous determination of 
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all the main cosmological parameters with unprecedented 
accuracies (Jungman et al. 1996; Bersanelli et al. 1996), in 
which case the major motivation of studies like this will be 
to constrain galaxy formation mechanisms and the power 
spectrum by measuring b directly, rather than, as here, on 
using it as a route to Slo- 

In a previous paper (Matarrese et al. 1997; hereafter 
MVH97) we presented a method to measure the bias param- 
eter from LSS data by studying higher-order characteristics 
of the density field. We developed an idea of Fry (1994): 
since the degeneracy between b and Jlo is an intrinsic fea- 
ture of linear theory, one needs to go to second order to 
separate the parameters. Under the assumption that the ini- 
tial fluctuation field is Gaussian and that structures grow by 
gravitational instability, the three-point correlation function 
and its counterpart in Fourier space, the bispectrum, arc in- 
trinsically second order quantities. The bispectrum has the 
advantage that the range of validity of perturbation theory 
appears well-defined (Scoccimarro et al. 1998) - if the power 
spectrum is roughly known - and in the appropriate range, 
the correlation properties of the estimators can be calcu- 
lated. If the bias is local, this allows us to estimate it via a 
likelihood method, and therefore to assign an errorbar. For 
consistency with second order perturbation theory (20PT) 
the first non-linear bias term must also be considered, and 
this can introduce an additional degeneracy between skew- 
ness induced by gravitational evolution and that introduced 
by bias. The two effects can, however, be separated by the 
use of shape information: essentially non-linear biasing of a 
truly Gaussian field will lead to different shaped structures 
from a non-Gaussian field arising from gravitational instabil- 
ity. Since the signal comes from the mildly non-linear regime, 
the bias can be measured on scales where A^(the conven- 
tional variance per unit logarithmic interval) is about unity. 
In MVH97 we showed that the bispectrum method we pre- 
sented succeeds in recovering the true value for the bias with 
a reasonable error in a very idealized case where the posi- 
tions of the particles were known in real space. In reality, 
the best distance indicator for large galaxy catalogues is 
redshift; the main purpose of this paper is to show how the 
bispectrum in redshift-space can be used to estimate bias. 

Galaxy catalogues use the redshift as third spatial co- 
ordinate. In a perfectly homogeneous Friedman Universe, 
redshift would be an accurate distance indicator, but inho- 
mogeneities perturb the Hubble flow, and introduce peculiar 
velocities. The resulting redshift-space map of the galaxy 
distribution is thus distorted. As Kaiser (1987) pointed out, 
peculiar velocities distort the clustering pattern in redshift 
space on all scales, but the effect can be regarded as being 
split into two components: a large-scale distortion result- 
ing from coherent inflow into over-dense regions, and the 
'Fingers-of-God' (FoG) arising from virialised, highly non- 
linear structures. For the bispectrum, we find distortions in 
redshift-space arising in 20PT (cf. Hivon et al. 1994), but 
also significant effects from virialised structures. In this pa- 
per, we incorporate the effects of virialised structures by a 
method which has been successfully used with the redshift- 
space power spectrum: we model the effects as an incoher- 
ent velocity dispersion. Once again the shape information 
allows the disentanglement of redshift space distortions and 
other effects: redshift-space mapping modifies the shape of 



the structures in a different way from gravitational evolution 
and biasing. 

The next generation of galaxy surveys, like for example 
the Anglo-Australian 2-degree Field (2dF, Colless 1996) and 
Sloan Digital Sky Survey (SDSS, Gurm & Weinberg 1995), 
will allow not only accurate measurement of P, but also es- 
timation of the bias parameter with an accuracy of a few 
percent. 

This paper is principally concerned with the treatment 
of redshift distortions; a subsequent paper will present a 
detailed study of the other issues. 

The plan of this paper is as follows. In Section 2 we 
quantify the effect that redshift distortions have on the bis- 
pectrum, and, as a consequence, on the determination of 
the bias parameter. Then we present a second-order per- 
turbation description for the effect of redshift distortions 
on the bispectrum and on the covariance matrix needed for 
the likelihood analysis. Also we discuss how to include the 
modelling of small scale, highly non-linear effects, such as 
FoG: non-linear dynamics and virialised structures in red- 
shift space contaminates scales that in real space would be 
only mildly non-linear. Section 3 contains the practical im- 
plementation of the method in a numerical sirrmlation of the 
redshift space galaxy distribution. In Section 4 we discuss 
the applicability of the method in more general terms: we im- 
plement the analysis on a open Universe simulation and on a 
biased distribution in real space. The results obtained from 
the unbiased and the biased distributions are also shown. 
Section 5 discusses the results also in the prospect of the 
next generation of large galaxy redshift surveys. 



2 MODELLING OF REDSHIFT DISTORTIONS 

It is worth making a few remarks about performing this sort 
of analysis in Fourier space. Fourier space analysis has the 
great advantage that there is a clear separation of scales 
where perturbation theory works and breaks down. Given 
the nature of the redshift distortion however, the natural 
way would be to decompose the density field in spherical 
harmonics and spherical Bessel functions (Heavens & Tay- 
lor 1995). But, we showed in MVH97, rather than using 
the survey as a whole, it is more effective to split it into 
subsamples. For a deep survey in the individual subsamples 
the Fourier decomposition and Kaiser's (1987) distant ob- 
server approximation to model even large-scale distortions 
is a good description. 

2.1 Eflfects of redshift distortions on the power 
spectrum and on the bispectrum 

As a preliminary, in this section we compute the effects of 
redshift distortion on a particularly simple class of bispec- 
tra, those whose wavevectors form an equilateral triangle 
and averaging over orientation. This analysis is not required 
in later sections, but serves to illustrate the effects of per- 
turbation theory and the virialisation model. 

On large (linear) scales the redshift space effect on an 
individual Fourier component of the density fluctuation fleld 
6j: can be described as: 

Su^6t = S^{l + f3n'), (1) 



© 0000 RAS, MNRAS 000, 000-000 



Large-scale bias in the Universe II: redshift space bispectrum 3 



where the superscript s refers to the quantity in redshift 
space, and fi is the cosine of the angle between the fc-vector 
and the line of sight (i.e. the direction along which the distor- 
tion takes place). Throughout, we adopt the distant-observer 
approximation, so fi is independent of location within the 
survey. 

The effect on the average power spectrum in a thin shell 
in fc-space is: 



(2) 



This is an enhancement of the redshift space power on all 
scales by the same factor, 28/15 in the case where (3—1. 

The bispectrum B{k\,k2,k-i) oc (^gj^^gj^^gfes^ non- 
zero only if the 3 wavevectors form a closed triangle. For 
equilateral triangles, averaging the three factors in equation 
(^) over ^ gives 



(3) 



so that for 13=1 there is an amplification of the bispectrum 
on all scales by a factor B" /B = 96/35. 

We can estimate the effect of redshift distortions on the 
estimation of the bias parameter by noting that, in the ab- 
sence of shot noise and a non-linear bias term, h oc / B 
(MVH97). If redshift distortions are ignored, the bias pa- 
rameter is overestimated by a factor 



B^/B 



= 1.27 



(4) 



a 13 = 1. 

Virialised motion produces a radial smearing and the as- 
sociated FoG effect contaminates the wavelengths we are in- 
terested in. This is hard to treat exactly, but being a smear- 
ing effect it produces a mild damping on the power acting in 
the opposite direction to the large-scale boosting of power 
(see for example Matsubara 1994). A model was introduced 
by Peacock (1992; see also Peacock & Dodds 1994) in which 
the small scale velocity field is assumed to have an incoher- 
ent Gaussian distribution. In reality, according to evidence 
from simulations (e.g. Zurek et al. 1994) and observations 
(e.g. Marzke et al. 1995) the velocity distribution is better 
modelled by an exponential, but in practice there is very 
little difference between these two models on scale where 
the damping factor is < 2 (Ballinger, Peacock & Heavens 
1996). In practice this incoherent model fits well the power 
spectrum in numerical simulations (Hatton & Cole 1998). 

In the Fourier domain the exponential velocity disper- 
sion model gives a damping factor Diko^) (where k = \k\), 
6j:D, given by 



such that (5g 



Dika^i) = 



1 



^l + fc2aVV2 



(5) 



where a is the pairwise velocity dispersion of galaxies. The 
overall effect in fc-space can be obtained by multiplying the 
large-scale enhancing factor and the small-scale damping 
factor before averaging over fj, (Peacock & Dodds 1994): 



(1 + /3m') 

yTTfcVV/2 



(6) 



For the power spectrum in a thin shell in fc-space the overall 
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Figure 1. Scale dependence of the redshift distortions on the 
measurement of the bias parameter, which is overestimated by 
1/ordinate if the distortions are ignored. The dashed line refers to 
the case where there is no small scale velocity dispersion (cr = 0), 
the solid line is for a = 200 km s~^, the dot-dashed line is for 
cr = 400 km s"'^ and the dotted line is for a = 700 km s~^. The 
wavenumbcr k is in units of h Mpc"'^. 



effect is given by: 
P'{k) = 



/3)^ai 



V2(feV2 - 2f3f arctan(fccr/^) 1 



(7) 



Similarly for the angle- averaged bispectrum for equilat- 
eral triangles one obtains 



B'{k) = 



+ 



12k-^a^l3 + (36fc^a^ + 6k^a^)f3'^ 
5fc^s^ - 30)/3^ 



y2fc'5crV2 + fc2cr2 



3V2f3{2k'^a'^ - 6k'^a^f3 + 5(3^) arcsinh(fc(T/^) 



B{k) . 
(8) 



The overall effect on the measurement of the bias pa- 
rameter is given by the ratio [equation (4)]. In the case (3 = 1 
(that, since (3 is expected to be < 1, is the case where large- 
scale redshift distortions are most important) the scale de- 
pendence of this quantity is shown in Fig. 1. For small ve- 
locity dispersions the effect is small for a reasonable range 
of fc-vectors, but for bigger a it diverges quite quickly. Since 
we expect to use wavelengths up to fc ~ 2 (throughout the 
k unit is h Mpc~^) in order to have a good signal to noise, 
ignoring redshift distortions introduces unacceptably big er- 
rors. In the next section we illustrate a more elegant way 
to deal with this problem and to have a more satisfactory 



© 0000 RAS, MNRAS 000, 000-000 



4 



L. Verde, A.F. Heavens, S. Matarrese & L. Moscardini 





1 




1 


DQ 






1 


CM 










1 






T 


1 


X 






1 







)5 
)4 

)^ 
)2 

)1 
,0 
1 

0, 




.0 



Figure 2. Redshift distortions effects on the bispectrum. The 
bispectrum is expressed in the dimensionless form that is the 
counterpart for the bispectrum of for the power spectrum: 
X'^ = (2/7r'^)^fc^B, where k is the smallest wavenumber in the 
triangle. The dot-dashed line is the 20PT-real space bispectrum, 
the dotted line is the redshift-space bispectrum obtained taking 
into account only the large scale boosting effect, the dashed line 
is the 20PT bispectrum with the small scale smoothing effect 
included as in equation (8). The velocity dispersion used here 
((T = 650 km s~^) is the one that gives the best fit to the redshift- 
space power spectrum up to fc ~ 0.8. Finally the thick solid line 
with errorbars is the measured bispectrum from the redshift-space 
map. The agreement up to fc ~ 0.7 should be compared with 
the corresponding breakdown of 20PT in real-space (fe = 0.55; 
MVH97). As before k is in units of h Mpc"!. 



result: the redshift distortions should be modelled for con- 
sistency to second order using perturbation theory and the 
small scale effect cannot be neglected. 

The redshift-space effect on the bispectrum from equi- 
lateral triangles is illustrated in Fig. 2, which shows re- 
sults from an N-body simulation by the Hydra consortium 
(Couchman et al. 1995). It is a cold dark matter (CDM) sim- 
ulation with Q.(i = 1, a. shape parameter F = 0.25, erg = 0.64, 
no cosmological constant and no biasing. 



2.2 Second-order perturbation description for the 
effect of redshift distortions 

In the framework of second order perturbation theory 
(20PT) for the density field, for consistency also redshift 
distortions should be treated to second order. Unfortunately 
highly non-linear effects contaminate scales where 20PT 
should hold. This can be easily understood by considering 
how a spherical over-density appears distorted by peculiar 
velocities when observed in redshift space (in the small angle 
approximation). At large scales (linear regime) the collaps- 
ing shells appear squashed, the squashing increases until the 
infall velocity cancels with the Hubble expansion. At this 
point, where the dynamics is only mildly non-linear, the re- 
gion appears to be collapsed on a sheet of infinite density 
in redshift space. At smaller scales the collapsing regions 



and the already virialised (highly non-linear) regions appear 
elongated along the line of sight (FoG). ^From equation (^ 
it is clear that the damping factor acts as a filter on the 
Fourier components of the density field. Therefore our anal- 
ysis involves 20PT of the density field and of the redshift 
distortions combined with the exponential model for small 
scale velocity dispersions acting as a smoothing filter. 

2.3 Large-scale second order model 

We assume a local correspondence between the galaxy over- 
density field 5g and the underlying mass density field 5 = 
Sp/p, and make a Taylor expansion in 5: 



(9) 



retaining terms to i = 2, but ignoring i = as it contributes 
only to fc = 0. 

The bispectrum B is defined by the 3-point function in 
Fourier space: 

(^sE/sfe/pfea) = (2^)'B(fci,fc2,fc3)5°(fci+fc2 + fc3) , (10) 

where is a 3D Dirac delta function. The expression for B 
is given in MVH97 for & in real space. In redshift space, the 
bispectrum is inevitably more complicated (Heavens et al., 
in preparation): 

B{ki,k2,k3) = 
6?(l+M?/3)(l+M2/3) 

{ [2P(fei)P(fc2)Ker(fci , fe, , M2 , A*) + eye] 
+ &2 [P{ki)P{k2) + eye.]} , (11) 
where the cyclic terms are (1,3) and (2,3), and 

fcs = — fci — k2 , 
r- ki 



rki 



(12) 



The kernel is given by 

Ker(fci,fc2,/3,Ati,^2,M) = 

J{ki,k2)bi + ^j.^f3hK^''\ki,k2) + ^^l^i|3^bi + 

u2n u2f 



2 ^ 

J.2 «2 



, 2 , 2-. , ,kx k2, 

2 (^^+^^) + — '^^'^^(fe^+fcT' 



— M1M2(M2^+Ml^), 

where the real-space kernel is (e.g. Catelan 
1994a) 



J(fcl,fc2) = 

5 aTi ■ k2 , , 2 

7 2fcifc2 Vfo fcJ 7 



fci ■ k2 

kik2 



(13) 
Moscardini 



(14) 



and if'^' comes from the velocity field (e.g. Catelan & 
Moscardini 1994b) 

K^^\ki,k2) = 

2 



3 k\ ■ k2 
7 ^ 



fci ^ fc2 

2fcifc2 \k2 ki 



+ 



ki ■ ki 



7 V kik2 



(15) 
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In real space the 20PT bispectrum is very weakly de- 
pendent on Qo (e.g. MVH97). In redshift space, there is a 
strong f2o-dependence, but only via /3, which can be mea- 
sured from the power spectrum. The residual dependence 
on Qo is again very weak, and we therefore quote the results 
for f^o — 1. 

We wish to write B in terms of observable quantities, 
which P is not. To the order which we are working, we may 
write P — Pg/h\. There are one-loop corrections to this 
which may be significant if &2 and 63 are not close to zero 
(Heavens et al. in preparation). In Section 4 we investigate 
the practical effects of those corrections using biased simu- 
lations, but consistently with perturbation theory, in terms 
of the real-space galaxy power spectrum Pg = we can 
write: 



{5 r 5 T 5 r Y 

\ gki gk2 gk^l 

where 



GiPg, 13, Cl, C2, fci, ^2,^3) 



(16) 



Q{Pg,P,ci,C2,ki,k2,kz) = 
(2^)^(1 +/3/x?)(l+/3/.i) 

{ci [2Pg{ki)Pg(k2)Ker{ki, k2, P, fii, fi2, ^i) + eye. 
+C2 [Pg{ki)Pg{k2) + eye] } + k2 + fca) , 



(17) 



where the parameters we wish to extract 61, 62 appear in 
the combinations 



Cl = — 



C2= To 



(18) 



It is interesting to notice here that the degeneracy between 
Jlo and b can be lifted because equation (131), as explained 
before, is dependent on flo mostly through the measur- 
able parameter /3 and is quite insensitive to A (see also 
Bernardeau 1994; Eisenstein 1997); the error introduced by 
neglecting the Q- and A-dependence of the bispectrum is 
much smaller that the final expected error in the determi- 
nation of the bias parameter, and can therefore be safely ig- 
nored for this purpose. Redshift-space distortions may also 
be introduced by a separate effect, arising from an incor- 
rect geometry being used to create the redshift-space map 
(Phillips 1994; Ballinger et al. 1996). This effect should be 
detectable with SDSS and corrected for. This accounts for 
the large-scale effects via perturbation theory. We now turn 
to the effects of small-scale velocities. 



2.4 Small scale model 

It is worth noticing that the model of small-scale velocity 
dispersion is simplistic because it takes no account of the 
fact that the velocity dispersion is correlated with the den- 
sity field. This means that the dispersion is higher in high 
density regions such as galaxy clusters. The value cr used 
in the model (equation 6) is therefore only an "effective" 
velocity dispersion which depends on how galaxies populate 
the clusters and on the bias parameter b (Fisher 1995). In 
principle, the form of the filtering may be unconnected with 
the small-scale velocity distribution, because the modelling 
assumes that the velocity is uncorrelated with the density, 
which is untrue in detail. In practice, then, one should per- 
haps treat the filter function as a heuristic object whose 



parameter a is determined from the large-scale data, by fit- 
ting the power spectrum. A Taylor expansion as A: ^ will 
demand a quadratic form for _D ~ 1 — k^a^fi^ /A + 0(k'^), 
so one might expect the model to be generally good pro- 
vided k^a^/i^ is small. We have found this to be the case for 
the power spectrum; in high-f2o models, imposing a limit of 
a — k^/i^ = 0.3 works well (for a CDM model with Q,o ~ 1, 
(78 = 0.64 and T = 0.25). 

The small scale damping effect acts as a smoothing filter 
(see equation ^) on the non-linear field. Following the gen- 
eral mathematical method outlined in Section 3 of MVH97 
for calculating A^-point distributions in Fourier space it is 
possible to obtain an expression for the galaxy N-point spec- 
tra in redshift space that includes the large scale effect to 
second order in perturbation theory (equation ^l|) and the 
small-scale effect due to the velocity dispersion of the galax- 
ies (equation The technical derivation can be found in 
Appendix A. For A'^ = 3, as a special case we obtain the final 
expression for the galaxy bispectrum in redshift space for a 
given triangle configuration, that also includes small scale 
velocity dispersion: 



Q{Pg,f3,ci,C2,ki,k2,kz) 



^(l + fc?(7V?/2) (l + fciaVi/2) (1- 



fc|aVi/2) 



(19) 

where Q is given by equation (|l7|). 

This is also the key equation of this paper, generalising 
MVH97 to redshift space galaxy catalogues. Note that we 
assume that j3 and a (and hence Pg{k) in real space) are 
computed directly from the redshift-space power spectrum 
[Pg{k)], SO they are not parameters to be determined from 
the bispectrum (see section 3.1). 

In order to be able to perform the likelihood analysis 
the covariance matrix for the bispectrum has to be consis- 
tently modified. The covariance matrix for the bispectrum 
involves the expression for the six-point function as shown 
in equations (15) and (16) of MVH97. 

Referring to equations (38) to (42) of MVH97, we wiU 
quote here how to modify the equations when working in 
redshift space: 



p(fc) __p(fc) (1+^^2)2 



(20) 



(in the power spectrum only, not the bispectrum below); 

(Sgk.^kjgk,) — G{Po,P,Cl,C2,kl,k2,k,) . (21) 

And finally the covariance matrix Ca/s obtained with 
this prescription needs to be filtered: 



nti(i+fc^v?/2) 



(22) 



where the index i runs over the six vectors that form the 
two triangles v\ and U2. 



3 TESTS ON N-BODY SIMULATIONS 

We have tested the model (equations |l^ & by perform- 
ing a likelihood analysis of the bispectrum on an unbiased 
redshift-space catalogue created from an N-body simulation 
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provided by the Hydra consortium (Couchman et al. 1995) . 
This is a CDM simulation with f2o = 1, (Jg = 0.64 and 
r = 0.25; these parameters have been chosen to match 
closely the present day galaxy power spectrum. It is clear 
from equations (0), (|l|) and (|l|) that in a realistic ap- 
plication we need to know (or have an accurate fit for) the 
real space galaxy power spectrum, because it is this quantity 
that is required in the model and, knowing it, subsequently 
we can attempt to evaluate a. 

Therefore, as a first step, we need to know if we are 
able to reconstruct the real-space galaxy power spectrum. 
The problem is greatly complicated by the fact that the 
method is applied on individual subvolumes, rather that on 
the overall volume. Even assuming that the data will be 
good enough to enable us to reconstruct the real-space power 
spectrum for the whole volume of the survey, some care is 
needed in extracting then the real space spectrum for each 
subsample. A subsequent paper will focus on this issue, in 
the present paper we shall assume that we have an accurate 
enough fit for the real space power spectrum. In Appendix 
B we give some limits on the accuracy needed. 

3.1 Limit of validity of the small scale redshift 
distortion model 

As mentioned in Section 2.4, the value a used in the model 
is only an "effective" velocity dispersion, so it can actually 
be seen as a parameter whose value is fixed by the condition 
that equation (6), when applied to the real space Fourier 
modes, gives the observed redshift space power spectrum. 
The limit of validity of the model, i.e. the maximum value 
of a = k^n'^ , can be set as follows. 

Since we can assume, as seen in Section 2.4, that the 
real-space power spectrum is known, a likelihood analysis of 
the redshift space Fourier modes should give the two param- 
eters p and a. 

The probability distributions of the real and imaginary 
parts of are Gaussian in linear theory, but the second or- 
der correction induces skewness. However in this regime the 
contribution from the second order correction is relatively 
small, and in any case, the Central Limit Theorem ensures 
that the average of a set of modes from a region in fc— space 
will tend to a gaussian for a large number of modes, and 
the likelihood for the set is then equivalent to a product of 
individual gaussians, since homogeneity alone ensures that 
the modes are uncorrelated. Therefore, the likelihood tends 
to a product of Gaussians with zero means and dispersion 
A/P=(fc, /i)/2, where 

P"(fc,M) = P{k){l + (3fi'^ fD{kafif . (23) 

The combined likelihood for /3 and a 
CiP,a) = 

i g ^ fp^Ifc^To 

(27r)W2n„(iP»(fc,M)) 

(24) 

The parameter l3 in the simulation is known, but also in a 
realistic application will be known through an independent 
method (see the Introduction), therefore we can say that 
the modelling for the small scale velocity dispersion effect 



breaks down where the likelihood fails to recover the "true" 
value for /3. On large scales the model is expected to be 
valid for any value of the factor a = k^H^, but on smaller 
scales a must be constrained. We find no constraint on a 
up to fc = 0.70 then a < 0.3 up to fc = 0.9. For higher k 
it becomes difficult to have a constraint on both /3 and cr 
for any value of a. This feature is a direct consequence of 
an effect noted also by Bromley, Warren & Zurek (1997): at 
short wavelengths the fi dependence is cancelling. However 
for k > 0.9 a can still be recovered with less than 5 per cent 
uncertainty keeping a < 0.3 and constraining /? to assume 
the value previously determined. 

We notice a slight increase of the value for a with k. This 
would suggest that the exponential model is not exact, but, 
as we already pointed out, a can be treated as an empirical 
parameter. 



3.2 Obtaining the effective velocity dispersion in 
the subvolumes 

In the individual subvolumes the power spectrum becomes 
noisier, and the large-scale modes are not well sampled. As- 
suming that the local real space power spectrum is known, 
we can determine the local a, because the local value for /3 
can be extrapolated from the global one as follows: if Ngy 
is the number of particles in the subvolume in redshift space 
and A'tot is the number of particles in the whole volume of 
the sample 

Aocal = /3global ( ^.^^ ^ . (25) 

V iVtot / 

There is a subtlety to point out here: in principle we 
would need to use the number of particles in the real space 
subvolume, but this is not known. However if the subsamples 
are not too small it should not make too much difference due 
also to the degeneracy that arises on those scales between /3 
and a. Moreover an error of 20 per cent in the mean den- 
sity (that is the biggest fluctuation we observe in splitting 
our simulation of WOh~^ Mpc side in subsamples of 50h~^ 
Mpc side) leads to an error of 5 per cent on /3iocai that is of 
the same order of the uncertainty on a obtained from the 
likelihood. 

Since we noticed a slight increase on the value for a 
with k, the value for a has been obtained independently for 
different intervals of k. 

We performed a likelihood analysis as described above, 
with /3 fixed, to recover a for each subvolume, in bands of k 
limited by the following values: 0.3, 0.7, 0.85, 1.0, 1.2, 1.3, 
with an additional band 1.7 — 2.6. 

The reason for this choice of intervals will be evident in 
the next section. 



4 LIKELIHOOD ANALYSIS 

It is evident from ( p^ ) that a set of triangles in fc-space allows 
the measurement of the bias parameters bi and 62 through 
a likelihood analysis. It is however necessary to consider tri- 
angles of different shapes in order to lift a new degeneracy 
between 61 and 62 since a set of triangles of the same shape 
gives almost degenerate information on 61 and 62 while for 



© 0000 RAS, MNRAS 000, 000-000 



Large-scale bias in the Universe II: redshift space bispectrum 7 



the equilateral triangles the degeneracy is complete. The op- 
timal triangle configurations are not easy to determine, but 
it is particularly efi^ective to consider, along with equilat- 
eral triangles, a configuration consisting of a repeated vector 
and one of twice the amplitude and opposite direction (re- 
ferred to as 'degenerate'). By ensuring that any k appears in 
only one triangle of either shape (and —k does not appear 
elsewhere either) the data are uncorrelated (MVH97): the 
covariance matrix is diagonal and the likelihoods for differ- 
ent shapes can be multiplied. This simplifies the likelihood 
analysis considerably. 

4.1 Choice of fc- vector limits 

4-1.1 r^o = 1 Universe 

Due to the smearing effect of small scale velocities the range 
of validity of 20PT is expected to be slightly extended. This 
is exactly what we observe. For equilateral configurations 
for example, from Fig. 2 it is clear that the redshift-space 
bispectrum for equilateral triangles is well described by our 
model up to ~ 0.7 - 0.9. 

In real space the fc-vector range we used for the like- 
lihood analysis of the bispectrum was: 0.3 < k < 0.55 for 
equilateral configurations, and 0.55 < fcshort < 1-1 for de- 
generate. In the redshift-space likelihood analysis for the 
bispectrum the range of fc-vector used changes: for equi- 
lateral configurations 0.3 < k < 0.85 and, for degenerate 
configuration, 0.85 < fcshort < 1-3. 

4-1.2 Low-^lo Universe 

In real-space the second-order bispectrum is only weakly de- 
pendent on r^o (e.g. MVH97), in redshift-space there is an 
additional dependence on Ho, but this is only through the 
parameter j3 that can be measured (Heavens et al. in prepa- 
ration). Although the remaining dependence on Slo is very 
weak, in a real survey we need to decide a priori where to 
place the upper cutoff in wavenumber. Since, in principle, 
this will be dependent on Q.o, we investigate a low-density 
numerical simulation to see how robust the cutoff wavenum- 
ber is. The simulation, again from the Hydra Consortium, 
is for a CDM model with parameters ^0=0. 3, A = 0, 
r=0.25, and a box-size of 100 Mpc. Fig. 3 shows how 
the real space likelihood contour for ci and C2 varies with 
the cutoff wavenumber for this simulation. This final output 
has o-g = 1.06, which makes it unsuitable for our redshift- 
space analysis. In fact the real space analysis recovers the 
true value for the bias within the errors (ci = 0.9 ± 0.5, 
C2 = 0.1 ± 0.7) but the first mildly non-linear wavelengths 
are larger than the box and the redshift-space distortions 
are too non-linear for the wavelengths we need to use. We 
have therefore analysed two earlier epochs, with erg = 0.64 
(when Q.0 = 0.5) and erg = 0.81 (when f^o = 0.41). 

For the redshift-space analysis in these cases, since there 
is a strong covariance between /3 and a, it was not always 
possible for all the fc-vector bands to follow the procedure 
outlined in Section 3 to set the values for the velocity disper- 
sion parameter a and the angle a. When it was not possible 
to close the likelihood contours for /3 and (t, we assumed 
P was known to set the value for a and we used for a the 
same constraint as in the other cases. We also find that the 



method for the redshift space analysis recovers the true value 
of the bias parameter within the errors. The errors are still 
big (100 per cent), but the non-linear scale is also quite big 
(fc < 0.3) and we can reduce the errors by analyzing subvol- 
umes (MVH97). A rough estimate for the number of volumes 
that one can obtain from the SDSS (about a thousand if lin- 
ear theory breaks down at fcni — 0.1, about eight thousand if 
fcni — 0.3) suggests that an accuracy of few percent or better 
should be achieved. 



4.1.3 General case 

The range of validity of second order perturbation theory de- 
pends on the model of the Universe, so we therefore need to 
be able to determine, in a consistent manner, where 20PT 
breaks down prior to estimating fio- For all the simula- 
tions we analyzed the breakdown occurs where the real-space 
normalized bispectrum = {2/Tx^)k^ B (analogous to the 
quantity for the power spectrum) is ~ 10 for equi- 
lateral triangles, and ~ 100 for degenerate triangles. This 
real-space quantity can be estimated, assuming, as we do, 
that (3 and the small-scale velocity dispersion a can be esti- 
mated from the redshift-space power spectrum. For the two 
earlier epochs of the low-f2o simulation we also changed erg 
by reinterpreting the box size (e.g. Mann, Peacock & Heav- 
ens 1997): also in these cases the breakdown occurs for the 
same values of the normalized bispectrum x^- 



4.2 Biased catalogue 

In order to create a biased catalogue, we applied the stan- 
dard friends-of-friends algorithm to the same fio = 1 N-body 
simulation used in MVH97 and in Section 2.1. We adopted a 
linking parameter equal to 15 per cent of the mean interpar- 
ticle distance, i.e. approximately 0.12/i~^ Mpc. The biased 
catalogue is defined as the list of all the groups of particles 
with at least three members; the corresponding number den- 
sity is 5.6 X lQ~^h? Mpc~^. Although this density is more 
realistic than that of the unbiased catalogue (2.1 Mpc"'') 
used in MVH97 and in the redshift-space analysis, this is still 
about six times larger than the number density that future 
galaxy surveys will have, but with this choice shot noise does 
not dominate the signal in the regime where 20PT should 
hold, allowing a determination of the breakdown of 20PT. 

This biasing scheme is not necessarily equivalent to the 
high peaks biasing scheme nor can necessarily be analyti- 
cally expressed as in equation (^). However for the purpose 
of recovering Q.q from the /3 parameter what is necessary is 
the "effective bias" haS defined as: Pg = b^sP- In the biased 
catalogue fccff ~ 1-1 on scale where 20PT holds. 

Also in presence of biasing the breakdown of 20PT can 
easily be detected. Fig. 4 shows how the joint likelihood con- 
tours vary for fc- vectors of the degenerate triangle configura- 
tion belonging to different independent shells in fc-space. It is 
clear that 20PT breaks down at fc ~ 0.8. When perturbation 
theory breaks down the likelihood does not drift in a precise 
direction as it happens for the unbiased case. This can be due 
to the superposition of the non-linear clustering (that drifts 
the likelihood towards high ci) and the biasing acting in the 
opposite direction. For the biased case the fc-vector limits 
are for the equilateral configuration 0.3 < k < 0.5 and for 



© 0000 RAS, MNRAS 000, 000-000 



L. Verde, A.F. Heavens, S. Matarrese & L. Moscardini 




2.0 




2.0 




2.0 



Figure 3. The real space joint likelihood for ci and C2 as a function of the cutoff wavenumber employed, for a low-density CDM 
simulation with shape parameter F = 0.25 and ag = 1.06. The dotted and solid line contours contain 90 and 63 percent of the a 
posteriori probability P(ci,C2 | DATA) assuming uniform prior. The wavenumber corresponding to A'^{k) = 1 is fc = 0.17. Second-order 
perturbation theory works well for the degenerate configuration up to fcshort — 0.66; this corresponds to x'' — 100. This figure is relative 
to the final output (eg = 1.06) of the low-f2o simulation. 
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Figure 4. The real space joint likelihood for ci and C2 for the biased catalogue. The dotted and solid line contours contain 90 and 63 
percent of the a posteriori probability. In each panel fc- vectors of the degenerate triangle configurations belong to a different shell in k 
space. For the equilateral configuration the fc-vectors limits are 0.3 < A: < 0.5 and for the degenerate the six shells are limited by: 0.62, 
0.7, 0.8, 0.9, 1.0, 1.1. The first interval has been chosen in order to have about the same number of data as in the following interval and 
a signal of comparable strength. Second-order perturbation theory works well for the degenerate configuration up to fcgj^Qj-^ ~ 0.8 this 
corresponds to — 100. The vertical dashed line shows the inverse of the effective bias of the power spectrum, y^Pg/P at fe = 0.8. 



degenerate configuration 0.5 < fcshort < 0.8. The breakdown 
occurs wliere tiie normalized bispectrum x'^ ~ {2/Tv'^)k^' B is 
X'^ — 10 for equilateral triangles and x^ — 100 for degenerate 
triangles. 

We also checked the behaviour of the 20PT validity 



range for a linearly biased catalogue. The catalogue has been 
created by applying equation (|9|) with foi = 1.4, bi = {i > 
2), to the density contrast field 5{x) of the f^o = 1, unbiased 
simulation. In this case 20PT breaks down for the same k- 
vectors values as in the unbiased case therefore where x^ 
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Figure 5. Joint lilceliliood of ci = l/bi and C2 = 62/61 , for a 
CDM N-body simulation (f2o = I, as = 0.64, T = 0.25, see text 
for further details), but in redshift space. Contours contain 68.3 
and 90 per cent of the a posteriori probability P(ci, C2 | DATA) 
assuming uniform priors for ci and C2 . 



Figure 7. Joint likelihood of ci = I/61 and C2 = 62/61, for the 
biased simulation in redshift space. The effective bias on scales 
where 20PT holds is 6eff=l.l (l/6eff = 0.91) as indicated by 
the dashed line. Contours contain 68.3 and 90 per cent of the a 
posteriori probability. 
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Figure 6. Joint likelihood of ci = I/61 and C2 = 62 /6j, for 
the real space N-body simulation (see text for details), but bi- 
ased. The effective bias on scales where 20PT holds is 6off =l.l 
(l/6efi = 0.91) as indicated by the dashed line. Contours contain 
68.3 and 90 per cent of the a posteriori probability. 



ruination of the velocity dispersion is more noisy. In fact 
the adopted biasing scheme considerably increases the shot 
noise, and this propagates in a noisy determination of the 
velocity dispersion especially if the volume of the simulation 
is limited to a box of 100 h~^ Mpc side. When applying the 
method to a real survey the volume available will be much 
larger, allowing a more precise determination of the velocity 
dispersion. 



4.3 Results 

For the flat Universe simulation in redshift space the joint 
likelihood for ci and C2 for the limits on k set as described 
above, is shown in Fig. 5. The true value (ci = l/6i = l, 
C2 = 62/61=0) is remarkably recovered within the errors: 
1.02 ± 0.15 for ci and -0.05 ± 0.22 for C2. 

The joint likelihood computed in the biased simulation 
for ci and C2 for the limits on k set as in Section 4.2 is shown 
in Fig. 6 and Fig. 7, in real and redshift space respectively. 
Marginalising over C2, the true value for the effective bias 
6eff = 1.1, Ccff = l/6off is well recovered through ci: we 
obtain ci = 0.93 ± 0.15, C2 = 0.15 ± 0.25 for the real space 
case and ci = 0.90 ± 0.22, C2 = 0.15 ± 0.35 for the redshift 
space case. 



is larger than the values previously dotorminod. However, 
unless there is anti-bias, the limit sets a conservative k- 
vector cutoff. 

Finally we performed the likelihood analysis on the bi- 
ased simulation with beff = 1.1 in redshift space. Also in 
this more realistic case the analysis recovers the true value 
for the bias parameter within the errors (see Fig. 7). In this 
case the errors in the determination of the bias parameter 
are slightly bigger; this is due to the fact that the deter- 



5 DISCUSSION 

In Matarrese at al. (1997) we presented the first steps to- 
wards the goal of using the bispectrum to measure the lin- 
ear bias parameter b from an ideal real-space galaxy dis- 
tribution. The aim of this project is ultimately to combine 
measurements of b with estimates of /3 = Qq'^ /b from linear 
redshift distortion studies to get an estimate of the density 
parameter fio- In this paper, we have mainly tackled the 
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problem of redshift-space distortions, where peculiar veloc- 
ities distort the three-dimensional map in redshift surveys. 
We have shown how the combination of second-order pertur- 
bation theory and an incoherent velocity dispersion model 
for virialised structures can successfully be used to estimate 
the bias parameter. This was not a trivial issue, since the sig- 
nal for bias comes largely from the mildly non- linear regime, 
which is significantly affected by non-perturbative consider- 
ations. 

We have also analyzed a biased simulation in real and 
redshift space: the bispectrum method successfully recovers 
the value for the bias parameter not only if we adopt a linear 
bias scheme, but also in a more realistic case where groups 
of dark matter particles are identified and each group counts 
as a galaxy. 

To turn this into a practical application, one needs to 
be able to determine, in a consistent manner, where second- 
order perturbation theory breaks down. For the simulations 
we have analysed, the breakdown occurs when the real-space 
normalised bispectrum (analogous to the quantity for 
the power spectrum) is — 10 for equilateral triangles, and 
100 for degenerate triangles. This real-space quantity can be 
estimated, assuming, as we do, that /3 and the small-scale 
velocity dispersion a can be estimated from the redshift- 
space power spectrum. 

The practical limitations to the accuracy of the determi- 
nation of b will be the size of the redshift survey, and the den- 
sity of objects. Our suggestion for practical implementation 
is to divide the survey into a number of subsamples, within 
each of which we apply the distant-observer approximation 
and make an estimate of the bias parameter (marginalising 
over the second-order term C2). The size of the subsamples is 
determined by the non-linear scale, k ~ 0.3 if & ~ 1, restrict- 
ing the subsample sizes to about 20 Mpc or larger. Since 
the signal comes from the weakly non-linear regime, the es- 
timates may be assumed to be essentially uncorrelated. For 
the most distant subsamples, shot noise will prevent any use- 
ful information being obtained. This will restrict the depth 
to around z = 0.2 for the forthcoming Anglo- Australian 
2dF and SDSS. Nevertheless this gives a very large number 
(about 1800 and about 8000 respectively) of useful subvol- 
umes (however, if the linear theory breaks down at kni — 0.1 
the number of volume that one can obtain from the SDSS 
drops to about a thousand). Combining this with the esti- 
mated error on b from a single subvolume of about 50 
Mpc suggests that errors of about 5 per cent or less should 
be achievable. The statistical error on /3 from these surveys 
should be comparable or smaller (e.g. Ballinger et al. 1996; 
but see also Hatton & Cole 1998, Bromley et al. 1997), so 
the error on ilo will be mainly determined by the error on b 
and should be around few percent. 
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APPENDIX A: EXPRESSION FOR THE 
N-POINT GALAXY SPECTRA IN REDSHIFT 
SPACE 

Following the mathematical method outlined in Section 3 of 
MVH97 it is possible to calculate the galaxy N-point spectra 
in redshift space. The expression for the N-point spectra 
includes the large scale effect to second order in perturbation 
theory (equation ^l|) and the small scale effect due to the 
velocity dispersion of the galaxies with velocity distribution 
modelled by an exponential (equation T 



spectra are negligible, but it may not be true if the subsam- 
ples are too small and the spectra become noisy or if the 
real space power reconstruction procedure yields the power 
spectrum with uncertainties. 

However it is possible to calculate the entity of the er- 
ror on the bias parameter due to uncertainty in the power 
spectrum as follows. We ignore shot noise for simplicity and 
we refer to equilateral triangles. Since the likelihood adds 
information inversely weighted by variance, the error on ci 
o"ci is given by 



by: 



The expression for the external source J'l^ is still given 1 



J^(x) = -iN { exp 



,(A1) 



but now the Fourier transform of the smoothing function 
(W) includes the effect of the filter: 

1 



(A2) 



The Ansatz for the generating functional is still the same: as- 
suming Gaussian initial conditions and allowing for a quasi- 
linear evolution in 20PT approximation all the irreducible 
correlation functions ^conn. of order n > 3 are negligible. 
But the difference is that in the expression for the generat- 
ing functional 



APPENDIX B: CONSTRAINTS ON THE 
NUMBER OF SUBSAMPLES 

In MVH97 we showed that the signal to noise of the bias 
estimate can be improved by splitting the volume in sub- 
samples. This is certainly true if the errors on the power 



^—^ (Tcj (shell) 

shells 



(Bl) 



(Tci (shell) being the contribution to the error on ci from a 
thin shell of radius k in fc-space. 

This can be expressed in terms of the error on the power 
spectrum. Let P(k) be the average power in the shell in k- 
space; P(k) is constant in the shell and has a variance Upf^i^y 
Therefore 



(j^j (shell) 



2 



4(7 



P(fc) 



P(k)2 



where u labels the triangles. 

Summing the contributions from all the shells. 



2 



(B2) 



(B3) 



Z[Jk] = 

exp li J d'xMx) - i / d^ard^x' Jfc(f) J-fe(f')fe'(5, x') (A3) 
-| / d3a;d3x'd3x''Jfc(x)Jfc(f')^fe(^'')ecoJ>'n':^(^,^',5?'')] , 

now ^conn'^'' and f^"** includes the redshift space effects that 
yield equations ( |ll[ ) to (|l^) (Heavens et al., in preparation). 

With these modifications the generating functional ap- 
proach outlined in MVH97 allows calculation of A'^-point 
spectra and their covariance properties directly in redshift- 
space taking also into account non-linearities to second order 
in perturbation theory. The advantages are manifold since 
non-linearities affect the statistics of the density field even 
on quite large scales and, at the same time, there is the ne- 
cessity to push the analysis to smaller scales to improve the 
statistics. 

The general prescription turns out to be that for a 
given configuration of {fci}i=i...jv the A''-point spectrum 

. . . (5^^ ) is first calculated consistently with 20PT for 
the density field and the redshift distortion, and then has to 
be 'filtered' with A*' damping factors as in equation (j^ each 
one relative to one . 

The galaxy bispectrum in redshift space for a given tri- 
angle configuration is a special case where N = 3 and it is 
given in equation (|l9|). 



and passing to the continuum limit, under the hypothesis 
that P is affected only by statistical error, we obtain 



2{27r)- 



■ r 

Jk, 



K^dk 



1. 
V 



(B4) 



where V is the volume of the sample in units of h~^ Mpc^. If 
fcmax is the same as in real space (fcmax previously found was 
0.55), we have — 53/VV. As we have seen, the redshift- 
space fcmax can be pushed towards higher k, reducing the 
error on ci: if fcmax = 0.85 as we found here, then — 
28/VV 

Therefore splitting the volume in M subvolumes is still 
effective if 



M Vsv 



7 

Kot 



(B5) 



where (B) is the error on ci due to the variance in the 
bispectrum, Vsv = Vtot/M is the volume of the subsamples. 
This is a constraint on the number of subsample to make, 
once the properties of the selection function of the survey 
and the range of validity of 20PT are known. 

As long as the errors on the reconstructed real-space 
power spectrum are of the same order of its statistical er- 
rors, and if still, as we found in the simulation, a^^ (B) ~ 0.5 
and the optimal size for the subvolumes is ~ 20h~^ Mpc 
(MVH97), then the use of the subvolumes is still effective 
both for SDSS and 2dF surveys. Any additional uncertainty 
on the power spectrum will alter the right-hand side of equa- 
tion (B5) placing a stronger constraint on the number of 
subvolumes and/or on their size. 
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